A new multi-label dataset for Web attacks CAPEC classification using machine learning techniques
نویسندگان
چکیده
There are many datasets for training and evaluating models to detect web attacks, labeling each request as normal or attack. Web attack protection tools must provide additional information on the type of detected, in a clear simple way. This paper presents new multi-label dataset classifying attacks based CAPEC classification, way features extraction ASCII values, evaluation several combinations algorithms. Using extract by computing average sum values characters field that compose request, algorithms (LightGBM CatBoost) classification evaluated, complete system is suffering. The test data used come from SR-BH 2020 dataset. Calculating different make up shows its usefulness numeric encoding feature extraction. allows models, also allowing various undergoing. combination two-phase model with MultiOutputClassifier module scikit-learn library, together CatBoost algorithm superiority criticality scenarios. Experimental results indicate machine learning multi-phase leads improved prediction attacks. Also, use suitable about
منابع مشابه
Comparison of Machine Learning Techniques for Multi-label Genre Classification
We compare classic text classification techniques with more recent machine learning techniques and introduce a novel architecture that outperforms many state-of-the-art approaches. These techniques are evaluated on a new multi-label classification task, where the task is to predict the genre of a movie based on its subtitle. We show that pre-trained word embeddings contain ’universal’ features ...
متن کاملExtreme Learning Machine for Multi-Label Classification
Xia Sun 1,*, Jingting Xu 1, Changmeng Jiang 1, Jun Feng 1, Su-Shing Chen 2 and Feijuan He 3 1 School of Information Science and Technology, Northwest University, Xi’an 710069, China; [email protected] (J.X.); [email protected] (C.J.); [email protected] (J.F.) 2 Computer Information Science and Engineering, University of Florida, Gainesville, FL 32608, USA; [email protected] 3 Department o...
متن کاملMulti-label ASRS Dataset Classification Using Semi Supervised Subspace Clustering
There has been a lot of research targeting text classification. Many of them focus on a particular characteristic of text data multi-labelity. This arises due to the fact that a document may be associated with multiple classes at the same time. The consequence of such a characteristic is the low performance of traditional binary or multi-class classification techniques on multi-label text data....
متن کاملClassification on Multi-label Dataset Using Rule Mining Technique
Most recent work has been focused on associative classification technique. Most research work of classification has been done on single label data. But it is not appropriate for some real world application like scene classification, bioinformatics, and text categorization. So that here we proposed multi label classification to solve the issues arise in single label classification. That is very ...
متن کاملTechniques for Traffic Sign Classification Using Machine Learning-a Survey
The Road Sign Recognition is a field of applied computer vision research concerned with the automatic detection and classification of traffic signs in traffic scene images. The aim of this research paper is to study the various classification techniques that can be used to construct a system that recognizes road signs in images. The primary objective is to develop an algorithm which will identi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computers & Security
سال: 2022
ISSN: ['0167-4048', '1872-6208']
DOI: https://doi.org/10.1016/j.cose.2022.102788